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Abstract — Based on the ideas of cyclotomic cosets, idempotents 
and Mattson-Solomon polynomials, we present a new method 
to construct GF(2 7Tl ), where m > cyclic low-density parity- 
check codes. The construction method produces the dual code 
idempotent which is used to define the parity-check matrix 
of the low-density parity-check code. An interesting feature of 
this construction method is the ability to increment the code 
dimension by adding more idempotents and so steadily decrease 
the sparseness of the parity-check matrix. We show that the 
constructed codes can achieve performance very close to the 
sphere-packing-bound constrained for binary transmission. 

Keywords — Coding, idempotent, non binary LDPC, Mattson- 
Solomon polynomial 



I. Introduction 

Since the recent rediscovery of low-density parity-check 
(LDPC) codes, a great deal of effort has been devoted to 
constructing LDPC codes that can work well with the belief- 
propagation iterative decoder. The studies of long block-length 
LDPC codes are very much established. The recent works 
of [1], [2] have shown that, for long block-lengths, the best 
performing LDPC codes are irregular codes and these codes 
can outperform turbo codes of the same block-length and 
code-rate. These long LDPC codes have degree distributions 
which are derived from differential evolution [1] or Gaussian 
Approximation [3]. It can be shown that, using the concen- 
tration theorem [4], the performance of infinitely long LDPC 
codes of a given degree distribution can be characterised by 
the average performance of the ensemble based on cycle- 
free assumption. This assumption, however, does not work 
for short and moderate block-length LDPC codes due to the 
inevitable existence of cycles in the underlying Tanner Graphs. 
Consequently, for a given degree distribution, the performance 
of short block-length LDPC codes varies considerably from 
the ensemble performance. Various methods exist for the 
construction of finite block-length irregular codes [5], [6], [7]. 
In addition to irregular LDPC codes, algebraic constructions 
exist and the resulting codes are regular and usually cyclic 
in nature. Some examples of algebraic LDPC codes are the 
Euclidean and Projective Geometry codes [8]. 

It has been noticed by the authors that, in general, there is a 
performance association between the code minimum distance 
(dmin) and decoding convergence. The irregular LDPC codes 
converge very well with iterative decoding, but their d m i n 



are reasonably low. On the other hand, the algebraically con- 
structed LDPC codes, which have high d m i n , tend not to con- 
verge well with the iterative decoder. It is not surprising that 
algebraically constructed codes may outperform the irregular 
codes. The latter have error-floor which is caused by the d m i n 
error-events. On the encoding side, the existence of algebraic 
structure in the codes is of benefit. Rather than depending 
on the parity-check or generator matrices for encoding, as in 
the case of irregular codes, a low-complexity encoder can be 
built for the algebraic LDPC codes. One such example is the 
linear shift-register encoder for cyclic LDPC codes. Assuming 
that n and k denote the codeword and information length 
respectively, algebraic codes that are cyclic offer another 
decoding advantage. The iterative decoder has n parity-check 
equations to iterate with instead of n — k equations, as in the 
case of non-cyclic LDPC codes, and this leads to improved 
performance. 

It has been shown that the performance of LDPC codes can 
be improved by going beyond the binary field [9], [10]. Hu 
et al. showed that, under iterative decoding, the non binary 
LDPC codes have better convergence properties than the 
binary codes [10]. They also demonstrated that a coding gain 
of 0.25dB is achieved by moving from GF(2) to GF(2 6 ). Non 
binary LDPC codes in which each symbol takes values from 
GF(2 m ) offer an attractive scheme for higher-order modula- 
tion. The complexity of the symbol-based iterative decoder can 
be simplified as the extrinsic information from the component 
codes can be evaluated using the frequency domain dual codes 
decoder based on the Fast-Walsh-Hadamard transform. 

Based on the pioneering works of Mac Williams [11], [12] 
on the idempotents and the Mattson-Solomon polynomials, 
we present a generalised construction method for algebraic 
GF(2 m ) codes that are applicable as LDPC codes. The 
construction for binary codes using idempotents has been 
investigated by Shibuya and Sakaniwa [13], however, their 
investigation was mainly focused on half-rate codes. In this 
paper, we construct some higher code-rate non binary LDPC 
codes with good convergence properties. We focus on the 
design of short block-length LDPC codes in view of the 
benefits for thin data-storage, wireless, command/control data 
reporting and watermarking applications. One of the desirable 
features in any code construction technique is an effective 
method of determining the d m , n and this feature is not present 
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in irregular code construction methods. With our idempotent- 
based method, the d m i n of a constructed code can be easily 
lower-bounded using the well-known BCH bound. 

The rest of the paper is organised as follows. In Section ITT1 
we briefly review the theory of the cyclotomic cosets, idempo- 
tents and Mattson-Solomon polynomials. Based on the theory, 
we devise a generalised construction algorithm and present 
an example in Section EH] We also outline an efficient and 
systematic algorithm to search for algebraic LDPC codes in 
Section m] In Section HV1 we demonstrate the performance of 
the constructed codes by means of simulation and Section fVl 
concludes this paper. 

II. Cyclotomic Cosets, Idempotents and 
Mattson-Solomon Polynomials 

We briefly review the theory of cyclotomic cosets, idempo- 
tents and Mattson-Solomon polynomials to make this paper 
relatively self-contained. Let us first introduce some notations 
that will be used throughout this paper. Let m and m' be 
positive integers with m\m! , so that GF(2 m ) is a subfield of 
GF(2 m '). Let n be a positive odd integer and GF(2 m ') be the 
splitting field for 1 + x n over GF(2 m ), so that n\2 m ' - 1. Let 
r = (2 m ' - l)/n, I = (2 m ' - l)/(2 m - 1), a be a generator 
for GF(2 m ') and be a generator for GF(2 m ), where = a l . 
Let T a (x) be the set of polynomials of degree at most n — 1 
with coefficients in GF(2 a ). 

Definition 2.1: If a(x) G T m /(x), then the finite-field trans- 
form of a(x) is: 

n-l 

A(z) = MS (o(s)) = ^ a(a- rj )z j (1) 

where A(z) 6 T m >(z). This transform is widely known as the 
Mattson-Solomon polynomial. The inverse transform is: 

n—l 

a(x) = MS' 1 (A{z)) = -J2 A{a rl )x l (2) 
n i=0 

Definition 2.2: Consider e(x) 6 T m (x), e{x) is an idem- 
potent if the property of e(x) — e(x) 2 mod (1 + x n ) is 
satisfied. In the case of m = 1, the property of e(x) = 
e(x 2 ) mod (1 + x n ) is also satisfied. 

An (n, k) cyclic code C can be described by the generator 
polynomial g(x) € T m (x) of degree n—k and the parity-check 
polynomials h(x) € T m (x) of degree k such that g(x)h(x) = 
1 + x n . It is widely known that idempotents can be used to 
generate C. Any GF(2 m ) cyclic code can also be described by 
a unique idempotent e g {x) G T m (x) which consists of a sum 
of primitive idempotents. This unique idempotent is known as 
the generating idempotent and, as the name implies, g(x) is 
a divisor of this idempotent, i.e. e g (x) — m(x)g(x), where 
m(x) contains the repeated factors or non-factors of 1 + x n . 

Lemma 2.1: If e{x) G T m (x) is an idempotent, E(z) — 
MS(e(x)) G Ti(z). 

Proof: (cf. [11, Ch 8]) Since e(x) = e(x) 2 mod (1 + 
x n ), from equation [l] it follows that e(a~ rj ) = e(a~ rj ) 2 , 
G {0, 1, . . . , n — 1} for some integers r and I. Clearly, 
e(a~ r3 ) G {0, 1} implying that E(z) is a binary polynomial. 



Definition 2.3: If s is a positive integer, the binary cyclo- 
tomic coset of s mod n is: 

C s = {2\s mod n | < i < t] , 

where we shall always assume that the subscript, s, is the 
smallest element in the set C s , and t is the smallest positive 
integer with the property that 2 t+1 s = s mod n. If N is the set 
consisting of the smallest elements of all possible cyclotomic 
cosets then 

C= UC s = {0,l,2,..,n-l}. 

Lemma 2.2: Let s 6 JV and let C s j represents the ith 
element of C s . Let the polynomial e s (x) G T m (x) be given 
by 

e s (x) = ^2 ec 3:i x c °<\ (3) 

0<i<|C s |-l 

where \C S \ is the number of elements in C s and ec B ( is defined 
below. 

i) if m = 1, ec s 4 = 1, 

ii) if m > 1, ec s 4 is defined recursively as follows: 

forz = 0, ec s „ G {1, 0, 2 , . . . , /3 2 " 1 - 2 }, 
for i > 0, ec Sii = e 2 ^ 

The polynomial so defined, e s (x), is an idempotent. We term 
e s (x) a cyclotomic idempotent. 

Definition 2.4: Let M C J\f and let u{x) G T m (x) be 

u(x) = e s (x). (4) 

Then (refer to lemma l2~2l u{x) is an idempotent and we call 
u(x) a parity-check idempotent. 

The parity-check idempotent u{x) can be used to describe 
the code C, the parity check matrix being made up of the n 
cyclic shifts of the polynomial x ie& ( utyX ^u(x- v ). 

If (u(x), 1 + x n ) — h(x) 1 then, in general, wt(u(x)) is 
much lower than wt(h(x)) 2 . Based on this observation and the 
fact that u(x) contains all the roots of h(x), we can construct 
cyclic codes that have a low-density parity-check matrix. 

Definition 2.5: Let the polynomial f{x) G T\{x). The 
difference enumerator of f(x), denoted as V(f(x)), is defined 
as follows: 

V(f(x)) = f(x)f(x- 1 ) = d + d lX + ... + d n ^x n -\ (5) 

where we assume that T>(f(x)) is a modulo 1— x n polynomial 
with real coefficients. 

Lemma 2.3: Let m = 1 and let di for < i < n — 1 
denote the coefficients of T>(u(x)). If di G {0, 1}, Vi G 
{1, 2, . . . , n — 1}, the parity-check polynomial derived from 
u{x) is orthogonal on each position in the n-tuple. Conse- 
quently (i) the dmin of the resulting C is 1 + wt(u(x)) and 
(ii) the underlying Tanner Graph has girth of at least 6. 

Proof: (i) (cf. [14, Theorem 10.1]) Let a codeword 
c(x) = Co + c\x + . . . + Cn^ix 11 - 1 and c{x) G T\{x). For 
each non zero bit position cj of c(x) where j G {0, 1, . . . , n — 

1 (a, b) denotes the greatest common divisor of a and b 
2 wt(f(x)) denotes the weight of polynomial f(x). 
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1}, there are wt{u{x)) parity-check equations orthogonal to 
position Cj, Each of the parity-check equation must check 
another non zero bit q I ^ j so that the equation is satisfied. 
Clearly, wt(c(x)) must equal to 1 + wt{u{x)) and this is the 
minimum weight of all codewords. (ii) The direct consequence 
of having orthogonal parity-check equation is the absence of 
cycles of length 4 in the Tanner Graphs. It can be shown that 
there exists three integers a, b and c, such that 2(6— a) = (c— b) 
for a < b < c. If these three integers are associated to the 
variable nodes in the Tanner Graphs, a cycle of length 6 can be 
formed between these variable nodes and some check nodes. 

■ 

From Lemma l2~3l we can deduce that u(x) is the parity- 
check polynomial for One-Step Majority-Logic Decodable 
codes if di € {0, 1}, Vi G {1, 2, . . . , n — 1} or the parity- 
check polynomial for Difference-Set Cyclic codes if di = 1, 
Vie{l,2,...,n-1}. 

Lemma 2.4: For the non binary GF(2 m ) cyclic codes, the 
d m i n is bounded by: 

do < d m in < min (wt(g(x)), 1 + wt(u(x))) 

where d denotes the maximum run of consecutive ones in 
U(z) taken cyclically modulo n. 

Proof: The lower-bound of the d m i n of a cyclic code, 
BCH bound is determined from the number of consecutive 
roots of e g {x) and from lemma |2~T1 it is equivalent to the run 
of consecutive ones in U(z). ■ 

III. Construction Algorithm for the Codes 

Based on the mathematical theories outlined above, we 
devise an algorithm to construct GF(2 m ) C which are applica- 
ble for iterative decoding. The construction algorithm can be 
described in the following procedures: 

1) Given the integers m and n, find the splitting field 
(GF(2 m ')) of 1 + x n over GF(2 m ). We can only con- 
struct GF(2 m ) cyclic codes of length n if and only if 
the condition of m\m' is satisfied. 

2) Generate the cyclotomic cosets modulo 2 rn — 1 and 
denote it C. 

3) Derive a polynomial p(x) from C. Let s G Af be 
the smallest positive integer such that \C' S \ = m. The 
polynomial p(x) is the minimal polynomial of a s : 

p(x)= 11 (x + a c '^ (6) 

0<i<m 

Construct all elements of GF(2 m ) using p(x) as the 
primitive polynomial. 

4) Let C be the cyclotomic cosets modulo n and Af be 
a set containing the smallest number in each coset of 
C. Assume that there exists a non empty set AA C 
Af and following definition 12.41 construct the parity- 
check idempotent u(x). The coefficients of u(x) can be 
assigned following lemma l2~2l 

5) Generate the parity-check matrix of C using the n cyclic 
shifts of x^^uix- 1 ). 

6) Compute r and I, then take the Mattson-Solomon poly- 
nomial of u(x) to produce U(z). Obtain the code 
dimension and the lower-bound of the d m i n from U(z). 



Note that care should be taken to ensure that there is no 
common factor between n and all of the exponents of u{x), 
apart from unity, in order to avoid a degenerate code. 

Example 3.1: Let us assume that we want to construct a 
GF(64) n = 21 cyclic idempotent code. The splitting field 
for 1 + x 21 over GF(64) is GF(64) and this implies that 
m = m' = 6, r = 3 and I = 1. Let C and C' denote 
the cyclotomic cosets modulo n and 2 m — 1 respectively. 
\C[\ — 6 and therefore the primitive polynomial p(x) has 
roots of a? Vj G C[, i.e. p(x) = 1 + x + x 6 . By letting 
1 + fJ + f3 6 — 0, all of the elements of GF(64) can be defined. 
If we let u(x) be the parity-check idempotent generated by 
the sum of the cyclotomic idempotents defined by C s where 
s G {A4 : 5,7,9} and e Ca>0 , Vs e M be /3 23 , 1 and 1 
respectively, u(x) = f3 23 x 5 + x 7 + x 9 + P 46 x w + /3 43 a; 13 + 

X U + x 15 + ^53^7 + X IS + ^58^19 + ^29^20 an( j hs MattSOn- 

Solomon polynomial U(z) tells us that it is GF(64)(21, 15) 
cyclic code with d m i n >— 5. 

A systematic algorithm has been developed to sum up all 
combinations of the cyclotomic idempotents to search for all 
possible GF(2 m ) cyclic codes of a given length. The search 
algorithm is targeted on the following key parameters: 

1) Sparseness of the resulting parity-check matrix. Since 
the parity-check matrix of C is directly derived from 
u(x) which consists of the sum of the cyclotomic idem- 
potents, we are only interested in low-weight cyclotomic 
idempotents. Let us define W max as the maximum 
wt(u(x)) then the search algorithm will only choose 
the cyclotomic idempotents whose sum has total weight 
less than or equal to W max . 

2) High code-rate. The number of roots of u(x) which are 
also roots of unity define the dimension of C and let 
us define k m i n as the minimum information length of 
C. We are only interested in the sum of the cyclotomic 
idempotents whose Mattson-Solomon polynomial has at 
least k m i n zeros. 

3) High d m i n . Let us define d as the minimum value of 
the d m in of C. The sum of the cyclotomic idempotents 
should have at least d— 1 consecutive powers of (3 which 
are roots of unity but not roots of u(x). 

The search algorithm can be relaxed to allow the existence of 
cycles of length 4 in the resulting parity-check matrix of C. 
The condition of cycles-of-length-4 is not crucial as we will 
show later that there are codes that have good convergence 
properties when decoded using iterative decoder. Clearly, by 
eliminating the cycles-of-length-4 constraint, we can construct 
more codes. 

Following definitions 12.11 and 12.41 

U(z)=Msl e s (x)) = E s{*) 
\seM ) seM 

and hence it is possible to maximise the run of the consecutive 
ones in U(z) if the coefficients of e s (x) are aligned appro- 
priately. It is therefore important that all possible non zero 
values of ec s , Vs G A4 are included in the search in order to 
guarantee that we can obtain codes with the highest possible 
dmin or at least to obtain a better estimate of the d m in- 
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IV. Code Performance 

As an example of the performance attainable from an 
iterative decoder, computer simulations have been carried out 
for several GF(2 m ) cyclic LDPC codes. We assume BPSK 
signalling and the iterative decoder used is the modified belief- 
propagation decoder which approximates the performance of 
a maximum-likelihood decoder [15], [16]. The frame-error- 
rate (FER) performance of the GF(2 6 )(21, 15) cyclic LDPC 
code is shown in Fig. ^ and is compared with the sphere- 
packing-bound [17], [18] for binary codes of length 126 bits 
offset by the binary transmission loss 3 . We can see that the 
performance of the code is within 0.2dB away from this 
bound at 10~ 3 FER. The binary level minimum-distance of 
this GF(64)(21, 15) cyclic LDPC code is 9. 
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Fig. 1. Frame error performance of the GF(2 6 )(21, 15) cyclic LDPC code 



3 In the rest of this paper, we assume that the sphere-packing-bound has been 
offset by the information theoretical loss associated with binary transmission. 
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Fig. 2. Frame error performance of the GF(2 2 )(255, 175) cyclic LDPC 
code 



Fig.|2]shows the FER curve of the GF(2 2 )(255, 175) cyclic 
LDPC code which is equivalent to (510, 350) binary code. 
At 10~ 3 FER, the performance of this code is approximately 
0.36dB away from the sphere-packing-bound of length 510 
bits. While both of the codes mentioned above are free from 
cycles of length 4, good convergence codes exist even if they 
have cycles of length 4 in the underlying Tanner Graph. One 
such example is the GF(2 3 )(91, 63) cyclic code whose FER 
performance is shown in Fig. [3] At 10~ 3 FER, the code 
performs around 0.35dB away from the sphere-packing-bound 
of length 273 bits. The parameters of the codes in Fig.[2]and[3] 
are available in TableU Some other examples of the non binary 
GF(2 m ) cyclic LDPC codes with their parameters and distance 
from the sphere-packing-bound are also shown in Table U 



TABLE I 
Code examples 



c 


u(x) 




< 


Comment 


SPBt 


GF(4) (51, 29) 


P 2 x 3 + /3x 6 + /3 2 x 12 ,x 17 + /3x 24 + 0x 27 + x 34 + 
P 2 x 39 + Px 45 + p 2 x 48 


5 


10 


m = 2, vn! = 8, r = 5 and I = 85 


0.25dB 


GF(4)(255, 175) 


Px 7 + /3 2 x 14 + /3x 28 + /3 2 x 56 + x 111 + px 112 + 

x 123 + 02J.131 + x 183 + 3-189 + ^193 + x 219 + 
x 222 + ^224 + ^237 + ^246 


> 17 


20 


m = 2, m' = 8, r = 1 and £ = 85 


0.36dB 


GF(4)(273, 191) 


P 2 x 23 + /3x 37 + /3x 46 + P 2 x 74 + /3x 91 + /3 2 x 92 + 
f3 2 x 95 +f3 2 x 107 +x 117 +Px 14S +l3 2 x 155 +/3 2 x la2 + 
(3x 1&4 + (3x 190 + x 195 + f3x 214 + x 234 


> 18 


20 


m = 2, m' = 12, r = 15 and I = 1365 


0.4dB 


GF(8)(63,40) 


l+/3 5 x 9 + /3x 13 +/3 3 x 18 + /3 2 x 19 +/3 2 x 26 + /3 6 x 36 + 
f3 4 x 3S + Px 41 + (3 4 x 52 


> 6 


10 


m = 3, w! = 6, r = 1 and / = 9 


0.3dB 


GF(8) (63, 43) 


/3 2 x 9 +f3 3 x 11 +(3 4 x ls +x 21 +f3 6 x 22 +p 3 x 25 +x 27 + 
Px 36 + /3 5 x 37 + x 42 + p 5 x 44 + x 45 + /3 6 x 50 + x 54 


> 8 


12 


m = 3, ml = 6, r = 1 and 1 = 9 


0.45dB 


GF(8)(91,63) 


/3 6 x+/3 5 x 2 +/3 3 x 4 +/3 6 x 8 +/3x 13 +/3 5 x 16 +/3 5 x 23 + 
P 2 x 26 +p 3 x 32 + P'°x 37 + P 3 x 46 +P 4 x 52 +/3 6 x 57 + 
P G x 64 + P 3 x 74 


> 8 


10 


m = 3, m' = 12, r = 45 and I = 585 


0.35dB 


GF(32)(31,20) 


1 + ^S^S + + p2S x W +x U +x n +(5 lA x W + 
pW x 20 + x 21 + x 22 + x 26 


> 7 


12 


m = 5, vn! = 5, r = I and I = 1 


0.4dB 


GF(32)(31,21) 


/3 23 x 5 +/3 29 x 9 +/3 15 x 10 + /3x 11 +/3 4 x 13 + /3 27 x 18 + 

(3 3 x 20 + plS^l + ^2^.22 + ^8^.26 


> 4 


8 


m = 5, m! = 5, r = 1 and 1 = 1 


0.25dB 



The code minimum distance in binary level. 

Distance to the sphere -packing-bound constrained for binary transmission. 
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Fig. 3. Frame error performance of the GF(2 3 )(91, 63) cyclic LDPC code 

V. Conclusions 

An algebraic construction technique for GF(2 m ) (m > 0) 
LDPC codes based on summing the cyclotomic idempotents to 
define the parity-check polynomial is able to produce a large 
number of cyclic codes. The fact that we consider step-by- 
step summation of the cyclotomic idempotents, we are able to 
control the sparseness of the resulting parity-check matrix. The 
lower-bound of the d m i n and the dimension of the codes can 
be easily determined from the Mattson-Solomon polynomial 
of the resulting idempotent. For GF(2) case where the parity- 
check polynomials are orthogonal on each bit position, we can 
even determine the true d min of the codes regardless of the 
code length. In fact, this special class of binary cyclic codes 
are the Difference-Set Cyclic and the One-Step Majority-Logic 
Decodable codes which can be easily constructed using our 
method. For non-binary cases, if the constructed code has low 
dmin, we can concatenate this code with an inner binary code 
to trade improvement in d m in with loss in code-rate. 

Simulation results have shown that these codes can converge 
well under iterative decoding and their performance is very 
close to the sphere-packing -bound of binary codes for the same 
code length and rate. The excellent performance of these codes 
coupled with their low-complexity encoder offers an attractive 
coding scheme for applications that required short block- 
lengths such as thin data-storage, wireless, command/control 
data reporting and watermarking. 
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